BIOTECHNOLOGY Jgfl t- '3 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 

Source . _J ^^(J^D * - 

-Date Processed b^STIC 3 J ( [p RS V 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION AND PATEN UN SOFTWARE QUESTIONS, PLEASE CONTACT 
MA RK SPENCER, TELEPHONE: 703-308-4 2 12; FAX: 703-308-4221 
Effective 12/1 VP J : TELEPHONE: 57 1 -272-25 10; FAX: 571-273-0221 

JO R KDUCK EHRORmj Sf-QUf; NCI- LIS TINGS, PLEASE USi-: Till:: CM ECKEK 
VERSION 4.1 PROGRAM , ACCESSim.l-: THROUGH Till: U S PATEN T AND 
TRADEMARK OFFICE WEBSITE SEE BELOW FOR ADDRESS 

http:/Avw\v.uspto.govAvcb/on"ice.s/nnc/chcckcr/chkr4 1 note, li tin 

0^ **** 

Applicants submitting genetic sequence 1 iilon n;it 10 11 elecironi«*Wr 011 diskWlc or CD-Rom should be auaic th.n the 

a possibility (hat the disk/CD-Rom may have been affected by treatment given to all incoming mail 

Please consider using alternate methods of submission for the disk/CO-Rom or replacement disk/CD-Rom 

Any reply including a sequence listing ui electron ic form should NOT be sen t 1 0 the 202 3 1 z ip c od c a d Id 1 ess lor the 

United States Patent and Trademark Office, and in stea d should be sent via the fo llowing t o the indicat ed ; 1 eld re s se s 

1. LTS-Bio (<h 1 1 iv./Avww. us pto.gov/c!)c7efsAlowri I ondsAlocnm en ts.ht in> , EFS Submission 

User Manual - cPAVE) 
2 U S. Postal Service: Commissioner for Patents P O Box 1450, Alexandria, VA 22313 1450 

3. Hand Can y directly to (EFFECTIVE 12/01/03): 

U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room IB*03, Crystal Pla/,i Two. 
201 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcel Service, or-olhcr delivery service to: U.S. Patent and Trademark Office. 
Box Sequence. Room M303-Mailrooin, Crystal Pla^i Two, 201 1 South Clark Place, Arlington, VA 22302 



Revised 10/08/03 



Raw Sequence Listing Error Summary 



ERROR DETECTED SUGGESTED CORRECTION SERIAL NUMBER:. 

ATTN: N^w RULES CASES: PLEASE DISCARD ENGLISH "ALPHA** IIEADEl£ WHICH WERE INSERTED BY PTO SOFTWARE 



-Wrapped Nucleics The number/text at the end of each line "wrapped" down to the next line. This may occur if your file 
Wrapped Aminos was relieved in a word processor after creating it. Please adjust your right margin to 3" this will 
prevent "wrapping." ' 

.Invalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 

.Misaligned Amino The numbering under each 5 ,h amino acid is misaligned. Do not use tab codes between numbers 



Numbering 
_Non-ASCII 

.Variable Length 



_PatentIn 2.0 
"bug" 



Skipped Sequences 
(OLD RULES) 



use space characters, instead 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules Please 
ensure your subsequent submission is saved in ASCII text. 

Sequence(s) contain n's or Xaa's representing more than one i^sidue. Per Sequence Rules 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentln version 2.0 has caused.the <220>-<223> section to be missing from amino acid 

sequences(s) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 



Sequence(s) 



. missing. If intentional, please insert the following lines for each skipped sequence: 



10 



I 1 



12 



.Skipped Sequences 
(NEW RULES) 



.Use of n's or Xaa's 
(NEW RULES) 



.Invalid <213> 
Response 

Use of <220> 



(2) INFORMATION FOR SEQ ID NO.X: (insert SEQ ID NO where "X" is shown) 
f (l \ c^, S£ Q UENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 
(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 

Scquence(s) missing. If intentional, please insert the following lines for each skipped sequence 

<2 1 0> sequence id number M 
<400> sequence" id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence Rules, the only valid <2I3> responses are: Unknown, Artificial Sequence or 
scientific name (Genus/species). <220>-<223> section is required when <2I 3> response is Unknown or 
is Artificial Sequence 



. missing the <220> "Feature" and associated numeric identifiers and responses. 



13 



_>atentln 2.0 
"bug" 

»• - 
. Misuse of n/Xaa 



Sequence(s) 

it c _->™ — T . . ~ - -■ —w*,.«»w« iiuinu iw iut.uu i icrs ana responses. 

Use of <220> to <223> ,s MANDATORY if <2I3> "Organism" responses "Artificial Sequence" or 

Unknown. Please explain source of genetic material in <?20> to <223> section 
(See "Federal Register," 0t5T0 1/1 998, Vol. 63, No, 104, pp. 2963^32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file - 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use 'Tile Manager" or any other manual means to copy file to floppy disk. 

"n" can only represent a single nucleotide : "Xaa" can only represent a single amino acid 

AMC - Biotechnology Systems Branch - 09/09/2003 * 
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IFWO 



RAW SEQUENCE LISTING DATE : 03/16/2004 

PATENT APPLICATION: US/10/658 , 355A TIME : 07:58:57 

Input Set : D:\923seq.002 

Output Set: N:\CRF4\03162004\J658355A.raw 

3 <110> APPLICANT: Gantier, Rene 

4 Guyon, Thierry 

5 Hugo, Cruz Ramos 

6 Vega, Manuel 

9 <120> ?SS a S i iwEN?iON: Rational Directed Protein Evolution Using Two Dimensional 

10 Rational 

11 Mutagenesis Scanning 

13 <130> FILE REFERENCE: 38751-923 

15 <140> CURRENT APPLICATION NUMBER: US/10/658 , 355A 

16 <141> CURRENT FILING DATE: 2003-09-08 QoeS, Hot Comply 

18 <150> PRIOR APPLICATION NUMBER: 60/457, 063 ~. y ,;,.,, V^i^V^r*. fcU?f*ri©' ' 

19 <151> PRIOR FILING DATE: 2003-03-21 * ■ ' , _^ / A \ 

21 <150> PRIOR APPLICATION NUMBER: 60/410, 258 J^U \ KJ? J 

22 <151> PRIOR FILING DATE: 2002-09-09 / *~ J 

24 <160> NUMBER OF SEQ ID NOS : 501 

25 <170> SOFTWARE: FastSEQ for Windows Version 4.0 JL~^" 
ERRORED SEQUENCES A/V^ \ 



7445 <210> SEQ ID NO: 182 J.^jY^ 



7446 <211> LENGTH: 55 // ^ ^ <2L- c\ 

7447 <212> TYPE: DNA " \ >TX ^ <VN<> ) 

7448 <213> ORGANISM: Artificial Sequence fc> 

74 50 <22 0> FEATURE: Qy ^> 

7451 <223> OTHER INFORMATION: EcoRI Forward Primer ^> 
7453 <400> SEQUENCE: 182 5^ 
E— > 7454 gcctetatga tttattggat gttggaattc cctgatgcgg tattttctcc ttacg ^ 

7455 <H>— ^ — 7 

7457 <210> SEQ ID NO: 183 \ f)^ YV [ 

7458 <211> LENGTH: 55 . . / (> 

7 459 <212> TYPE: DNA ' 

7460 <213> ORGANISM: Artificial Sequence 
7 4 62 <220> FEATURE: 

7463 <223> OTHER INFORMATION: EcoRI Reverse Primer 

7465 <400> SEQUENCE: 183 ^ 
E— > 7466 ca^ggaga aaa taccgca tcagggaatt ccaacatc ca ataaatcatacaggc £j £ 

7467 " ' — " * r^f- 

7469 <210> SEQ ID NO: 184 ffltfrll 

7470 <211> LENGTH: 35 /\J)f / 0( 

7471 <212> TYPE: DNA 

7472 <213> ORGANISM: Artificial Sequence 



file://C:\CRF4\Outhold\VsrJ658355A.htm 
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RAW SEQUENCE LISTING DATE : 03/16/2004 

PATENT APPLICATION: US/10/658 , 355A TIME : 07:58:58 

Input Set : D:\923seq.002 (^/L^^ 
Output Set: N:\CRF4\03162004\J658355A.raw -> ^rryt 

1414 <220> FEATURE: 

7475 <223> OTHER INFORMATION: Seq Clal Forward Primer 
7477 <400> SEQUENCE: 184 
E > 7478 ctgfrtejtatca accgcggtac atatgattga catgc 

7479 ( 35 y V 

7481 <ZTtf> SEQ ID NO: 185 

7482 <211> LENGTH: 31 

7483 <212> TYPE: DNA 

7484 <213> ORGANISM: Artificial Sequence 

7486 <220> FEATURE: 

7487 <223> OTHER INFORMATION: Seq Clal Reverse Primer 
7489 <400> SEQUENCE: 185 

E > 7490 tao^ggataa taccgcgcca catagcagaa c 

7 4 91 ("31.^- " — "~ 

7493 <?T0> SEQ ID NO: 186 

7494 <211> LENGTH: 18 

7495 <212> TYPE: DNA 

7496 <213> ORGANISM: Artificial Sequence 

7498 <220> FEATURE: 

7499 <223> OTHER INFORMATION: Seq Forward Primer 
7501 <400> SEQUENCE: 186 _^ 

E — > 7502 cctgatgaag gaggactc ^ 

7503 M) _ .„ ... 

7505 <ZT0> SEQ ID NO: 187 

7506 <211> LENGTH: 18 

7507 <212> TYPE: DNA 

7508 <213> ORGANISM: Artificial Sequence 
7 510 <22 0> FEATURE: 

7511 <223> OTHER INFORMATION: Seq Reverse Primer 
7513 <400> SEQUENCE: 187 



E — > 7514 ccaagcagca gatgagtc 

7 515 /18> " 

7517 <2T0> SEQ ID NO: 188 

7518 <211> LENGTH: 31 

7519 <212> TYPE: DNA 

7520 <213> ORGANISM: Artificial Sequence 

7522 <220> FEATURE: 

7523 <223> OTHER INFORMATION: I FN alpha-2b 5' Primer 
7525 <400> SEQUENCE: 188 

E — > 7526 tcppgptgcaa gtcaagctgc tctgtgggct g ^ _ 

7 527 (31;' - — ~ _ ~ 

7529 <2Tt)> SEQ ID NO: 189 

7530 <211> LENGTH: 48 

7531 <212> TYPE: DNA 

7532 <213> ORGANISM: Artificial Sequence 

7534 <220> FEATURE: 

7535 <223> OTHER INFORMATION: I FN alpha-2b 3 1 Primer 
7537 <400> SEQUENCE: 189 

E— > 7538 gctctagatc attccttact tcttaaactt tcttgcaagt ttgttgac 
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RAW SEQUENCE LISTING DATE : 03/16/2004 

PATENT APPLICATION: US/10/658 , 355A TIME: 07:58:58 



Output Set: N:\CRF4\03162004\j658355A.raw 



2fW 




Input Set : D:\923seq.002 

7539 48 

7541 <210> SEQ ID NO: 190 

7542 <211> LENGTH: 36 

7543 <212> TYPE: DNA 

7544 <213> ORGANISM: Artificial Sequence 

7546 <220> FEATURE: 

7547 <223> OTHER INFORMATION: I FN alpha-2b Hindlll Primer 

754 9 <400> SEQUENCE: 190 _ ^> & 

E — > 7550 ccograjgctta tggccttgac ctttgcttta^ctggtg_^^/ 



7551 (3£/^ 



7553 <210> SEQ ID NO: 191 

7554 <211> LENGTH: 48 

7555 <212> TYPE: DNA 

7556 <213> ORGANISM: Artificial Sequence 

7558 <220> FEATURE: 

7559 <223> OTHER INFORMATION: I FN alpha-2b Xbal Primer 
7561 <400> SEQUENCE: 191 

E— > 7562 gc£ptagatc attccttact tcttaaactt tcttgcaagt ttgttgac O 

75 63 / f 48) ^- y 

7565 ^lt)> SEQ ID NO: 192 

7566 <211> LENGTH: 80 

7567 <212> TYPE: DNA 

7568 <213> ORGANISM: Artificial Sequence 

7570 <220> FEATURE: 

7571 <223> OTHER INFORMATION: I FN alpha-2b 80 bp 5' Primer 

7573 <400> SEQUENCE: 192 ✓ X) 

E— > 7574 cccaagctta tggccttgac ctttgcttta ctggtggccc tcctggtgc^agc^ U 

E— > 7575 (80>- _ - £*Cj 

E — > 7576 tcaagctgct ctgtgggctg 

7578 <210> SEQ ID NO: 193 

7579 <211> LENGTH: 20 

7580 <212> TYPE: DNA 

7581 <213> ORGANISM: Artificial Sequence 

7583 <220> FEATURE: 

7584 <223> OTHER INFORMATION: EMCV Forward Primer 

7586 <400> SEQUENCE: 193 - 
E — > 7587 cccjc^acatt gaggcatcca _ " 7 

7588 ("'2b ) 

7590 <210> SEQ ID NO: 194 

7591 <211> LENGTH: 21 

7592 <212> TYPE: DNA 

7593 <213> ORGANISM: Artificial Sequence 

7595 <220> FEATURE: 

7596 <223> OTHER INFORMATION: EMCV Reverse Primer 
7598 <400> SEQUENCE: 194 

E — > 7599 cagg^gcagg acaaggtcac t 

7 600 — 

7602 <210> SEQ ID NO: 195 

7603 <211> LENGTH: 22 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/658 , 355A 



DATE: 03/16/2004 
TIME: 07:58:58 



Input Set : D:\923seq.002 

Output Set: N:\CRF4\03162004\J658355A.raw 



7604 
7605 
7607 
7608 
7609 
7610 
7611 
7612 
7614 
E — > 7615 
7616 
8074 
8075 
8076 
8077 
8079 
8080 
8082 
E — > 8083 
8084 
8086 
8087 
8088 
8089 
8091 
8092 
8094 
E — > 8095 
8096 
8098 
8099 
8100 
8101 
8103 
8104 
8106 
E— > 8107 
8108 
8110 
8111 
8112 
8113 
8115 
8116 
8118 
E — > 8119 



<212> TYPE: DNA 
<213> ORGANISM: Artificial Sequence 
<22 0> FEATURE: 

<221> NAME /KEY : misc_f eature 
<222> LOCATION: 1; 22 

<223> OTHER INFORMATION: EMCV Probe: n at position 1 is a 5-carboxyf luorescem 
(FAM) -modified deoxycyt idylate ; n at position 22 is an N,N,N',N'- 
tetramethyl-6-carboxyrhodamine (TAMRA) -modified deoxythymidylate . 
<400> SEQUENCE: 195 

rtca agacccaacc gen 

<2Tt)> SEQ ID NO: 208 
<211> LENGTH: 41 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Oligonucleotide 
<400> SEQUENCE: 208 

aacatatgtg tgatctgect caaacccaca gectgggtag c 

<210> SEQ ID NO: 209 
<211> LENGTH: 46 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Oligonucleotide 
<400> SEQUENCE: 209 

aagg^tcctc attccttact tcttaaactt tettgeaagt ttgttg 

<2TD> SEQ ID NO: 210 
<211> LENGTH: 41 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<22 0> FEATURE: 

<223> OTHER INFORMATION: Oligonucleotide 
<400> SEQUENCE: 210 

aacl*£atgtg tgatctgect caaacccaca gectgggtag c 



211 



8120 
1907 
1907 



> SEQ ID NO: 
<211> LENGTH: 46 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Oligonucleotide 
<400> SEQUENCE: 211 

aagga^cctc attccttact tcttaaactt tettgeaagt ttgttg 

2 Vzl0> SEQ ID NO: 500 

3 <211> LENGTH: 46 



file://C:\CRF4\Outhold\VsrJ658355A.htm 



3/16/04 



Page 5 of 7 




RAW SEQUENCE LISTING DATE: 03/16/2004 

PATENT APPLICATION: US/10/658 , 355A TIME: 07:58:59 

Input Set : D:\923seq.002 

Output Set: N:\CRF4\03l62004\J658355A.raw 

19074 <212> TYPE: DNA 

19075 <213> ORGANISM: Artificial Sequence 

19077 <220> FEATURE: 

19078 <223> OTHER INFORMATION: primer reverse INFA-E159H 
19080 <400> SEQUENCE: 500 

E — > 19081 yRfcggatcctc attccttact tcttaaactg tgttgcaagt ttgttg 

1908^746/ 

19084 s ^210> SEQ ID NO: 501 

19085 <211> LENGTH: 46 

19086 <212> TYPE: DNA 

19087 <213> ORGANISM: Artificial Sequence 

19089 <220> FEATURE: 

19090 <223> OTHER INFORMATION: primer reverse INFA-E159Q 
19092 <400> SEQUENCE: 501 

E — > 19093 aaqgEKjtcctc attccttact tcttaaactc tgttgcaagt ttgttg 

19094 $6 
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VERIFICATION SUMMARY DATE: 03/16/2004 

PATENT APPLICATION: US/10/658 , 355A TIME: 07:59:00 



Input Set : D:\923seq.002 

Output Set: N:\CRF4\03162004\J658355A.raw 



L: 


7454 


M: 


254 




No. 


of 


Bases 


conflict , 


LENGTH : 


Input : 


0 


Counted: 


55 


SEQ: 


1 oz 


L: 


7466 


M: 


254" 


JE-i 


No. 


of 


Bases 


conflict, 


LENGTH: 


Input : 


0 


Counted: 


55 


SEQ: 


18 3 


L: 


7478 


M: 


254 


E: 


No. 


of 


Bases 


conflict , 


LENGTH: 


Input : 


0 


Counted : 


35 


SEQ : 


184 


L: 


7490 


M : 


25 4" 


E: 


No. 


of 


Bases 


conflict, 


LENGTH: 


Input : 


0 


Counted: 


31 


SEQ: 


18 5 : 


L: 


7502 


M: 


254 


• E: 


No. 


of 


Bases 


conflict , 


LENGTH: 


Input : 


0 


Counted: 


18 


SEQ: 


186 


L: 


7514 


M: 


254 


E: 


No. 


of 


Bases 


conflict , 


LENGTH: 


Input : 


0 


Counted: 


18 


SEQ: 


18 7 


L: 


7526 


M: 


254 


E: 


No. 


of 


Bases 


conflict, 


LENGTH: 


Input : 


0 


Counted: 


31 


SEQ: 


188 - 


L: 


7538 


M: 


254 


E: 


No. 


of 


Bases 


conflict , 


LENGTH: 


Input : 


0 


Counted: 


48 


SEQ: 


189, 


L: 


7550 


M: 


254 


E: 


No. 


of 


Bases 


conflict, 


LENGTH: 


Input : 


0 


Counted : 


36 


SEQ: 


190 '" 


L: 


7562 


M: 


254 


E: 


No. 


of 


Bases 


conflict, 


LENGTH : 


Input : 


0 


Counted: 


48 


SEQ: 


191 -r 


L: 


7574 


M: 


254 


E: 


No. 


of 


Bases 


conflict, 


LENGTH: 


Input : 


0 


Counted: 


60 


SEQ: 


192 


M: 


254 


Repeated 


in SeqNo=192 
















193 - 


L: 


7587 


M: 


254 


E: 


No. 


of 


Bases 


conflict , 


LENGTH: 


Input : 


0 


Counted: 


20 


SEQ: 


L: 


;7599 


M : 


254 


E: 


No. 


of 


Bases 


conflict, 


LENGTH: 


Input : 


0 


Counted: 


21 


SEQ: 


194 


L: 


;7615 


M: 


: 341 


W: 


(46) 


"n" or 


"Xaa" used, 


for SEQ ID#: 


195 after 


pos . : 0 





L:7615 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:22 SEQ:195 .. 
L:8083 M: 254 E : No. of Bases conflict, LENGTH : Input : 0 Counted:41 SEQ:208 
L:8095 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:46 SEQ:209 , 
L-8107 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:41 SEQ:210 *>" 
L:8119 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:46 SEQ:211 ..... 
L-19081 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:46 SEQ:500_. 
L:19093 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:46 SEQ:501 
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